Probability and Statistics
Subject: Mathematics
Topic: 9
Cambridge Code: 0580
Probability Fundamentals
Basic Concepts
Experiment - Process producing outcomes Sample space (S) - All possible outcomes Event (A) - Subset of sample space Probability - Likelihood of event, 0 ≤ P(A) ≤ 1
Probability Definition
Assumes equally likely outcomes
Probability Rules
Addition Rule
For any events A and B:
Mutually exclusive (A and B cannot occur together):
Multiplication Rule
For independent events:
For dependent events:
Complement Rule
Conditional Probability
Conditional probability - Probability given another event occurred
Bayes' Theorem
Generalized:
Distributions
Discrete Distributions
Binomial Distribution - n independent trials, probability p
Mean: Variance:
Poisson Distribution - Rare events in fixed time/space
Mean: Variance:
Continuous Distributions
Normal Distribution - Bell curve, symmetric
Mean: μ Standard deviation: σ
Standardization: follows standard normal (μ = 0, σ = 1)
Central Limit Theorem
Distribution of sample means approaches normal as n increases, regardless of parent distribution
Statistics
Measures of Central Tendency
Mean (average):
Median - Middle value when ordered
Mode - Most frequent value
Weighted mean:
Measures of Spread
Range - Maximum - Minimum
Variance:
Standard deviation:
Interquartile range (IQR) -
Standardization
Allows comparison across scales
Sampling
Sample vs Population
Population - Entire group of interest Sample - Subset used for analysis Bias - Systematic error favoring certain values Random sampling - Every element equally likely
Sampling Methods
Simple random - Every possible sample equally likely Stratified - Divide into strata, sample from each Systematic - Select every kth element Cluster - Divide into clusters, sample clusters
Hypothesis Testing
Null and Alternative Hypotheses
Null hypothesis - Status quo claim Alternative hypothesis - Research claim
- One-tailed or two-tailed
Significance Level
α - Probability of Type I error
- α = 0.05 (most common)
- α = 0.01 (more stringent)
Type I and Type II Errors
| H₀ True | H₀ False | |
|---|---|---|
| Reject H₀ | Type I error | Correct |
| Fail to reject H₀ | Correct | Type II error |
p-value
p-value - Probability of observing data if H₀ true
Decision:
- p < α: Reject H₀
- p ≥ α: Fail to reject H₀
Test Statistics
t-test - Compare means, small samples z-test - Compare means, large samples χ² test - Goodness of fit, independence F-test - Variance comparison
Correlation and Regression
Correlation Coefficient
Pearson's r - Measures linear relationship (-1 to 1)
- r = 1: Perfect positive
- r = 0: No linear relationship
- r = -1: Perfect negative
Linear Regression
Least squares line:
Coefficient of determination: - Proportion of variance explained
Data Visualization
Distributions
Frequency histogram - Shows distribution shape Stem-and-leaf - Shows individual values Box plot - Shows quartiles and outliers
Relationships
Scatter plot - Shows correlation Line graph - Shows trends over time
Key Points
- Probability: 0 ≤ P(A) ≤ 1
- Addition rule for unions
- Multiplication for intersections
- Conditional probability changes with information
- Binomial for discrete, count data
- Normal for continuous data
- Central Limit Theorem for sampling
- Mean and standard deviation describe distributions
- Hypothesis testing uses significance level
- Correlation measures relationship strength
Practice Questions
- Calculate probabilities using rules
- Apply Bayes' theorem
- Find binomial probabilities
- Use normal distribution tables
- Calculate sample statistics
- Conduct hypothesis tests
- Find confidence intervals
- Calculate correlation
- Fit regression lines
- Interpret results
Revision Tips
- Know probability rules thoroughly
- Understand normal distribution properties
- Practice hypothesis testing steps
- Understand Type I and Type II errors
- Learn when to use each test
- Interpret correlation correctly
- Practice with real data
- Understand limitations
- Know when normal approximation applies